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VANS YAN Overview: Wissen Directorates Nasa 


Vision: To reach for new heights and reveal the unknown so that 
what we do and learn will benefit all humankind 

Mission: Jo pioneer the future in space exploration, scientific 
discovery, and aeronautics research 

Aeronautics Research (ARMD): Pioneer and prove new flight 
technologies for safer, more secure, efficient, and environmentally 
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Human Exploration and Operations (HEOMD): Focus on ISS 
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Science (SCMD): Explore the Earth, solar system, and universe 
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Marshall Space Flight Center 
Huntsville, AL 


Stennis Space Center 
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¢ Digital experiments and physical experiments are tradable 

¢ Physical systems and live tests are generally expensive & dangerous (e.g., extreme 
environments), require long wait times, and offer limited sensor data 

NASA collects and curates vast amounts of observational science data that require 
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¢ NASA‘s goals in aeronautics, Earth and space sciences, and human and robotic exploration 
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reduce cost, mitigate risk, accelerate R&D, and heighten societal impact 
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NANCY ANS DB) Ic] ac1= HPC Requirements 


¢ Engineering requires HPC resources that can 
process large ensembles of moderate-scale 
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(high throughput / capacity) 

¢ Research requires HPC resources that can handle 
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¢ Time-sensitive mission-critical applications require 
HPC resources on demand (high availability / 
maintain readiness) 


Computing Systems 


Balanced HPC Environment 


Pleiades: 212K-core SGI Altix ICE with 4 generations 
of Intel Xeon (4 racks GPU-enhanced: M2090, K40; 
16 nodes have Phi 5110P); 723 TB RAM; 5.3 PF peak 


Merope: 12K-core SGI Altix ICE with 2 generations of 


Intel Xeon; 28 TB RAM; 141 TF peak 


¢ Endeavour: Two SGI UV2000 nodes with 2 and 4 TB 
shared memory SSI via NUMALink-6; 32 TF peak 


¢ hyperwall: 1024-core AMD Opteron, 128-node GPU 
M2090 cluster for large-scale rendering & concurrent 
visualization 

Data Storage 

¢ 20 PB of RAID over several Lustre filesystems 

¢ 115 PB of tape archive 


Networks 
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Scientists and engineers plan 


computational analyses, selecting the 
best-suited codes to address NASA's 
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Performance 
Optimization 
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NAS software experts 
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a understanding and insight, accelerated 

science and engineering, and increased 

mission safety and performance 


Data Analysis 
and Visualization 


NAS visualization 
experts apply advanced 
data analysis and 
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Computational Modeling, results 

Simulation, & Analysis 
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software, networks, and storage) to rapidly solve 
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¢ Execute many threads simultaneously at relatively lower power 
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- Nvidia GPGPU: Did not get much traction within NASA 
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Intel Xeon Phi (KNC) evaluation 
¢ 128 nodes, each with 2 Sandy Bridge and 2 KNC 


¢ Examine performance in four different execution modes: 
Native Host, Off-load, Symmetric, Native MIC 


¢ Muicro-kernel benchmarks: Memory bandwidth / latency, MPI functions, OpenMP constructs 
¢ NAS Parallel Benchmarks (NPB): OpenMP, MPI, MPI+OpenMP 

¢ Applications: OVERFLOW, Cart3D, WRF 

¢ Results reported without extensive code modifications 


Summary Performance Results Nasa 


¢ System stability initially an issue but situation improved as MPSS (Many-core 
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¢ Running codes in Native modes lead to wasted resources 
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Hybrid code in Symmetric mode yields best performance due to reduced MPI 
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¢ KNC not ready for prime time, but next generation KNL looks promising due to 
no host and several other architectural improvements 


Extensive details in SC2013 paper by S. Saini et al.: “An early performance 
evaluation of many integrated core architecture based SGI rackable computing 
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Big Data 
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¢ Observational Data 

— Tens of satellites and telescopes producing 
multi-petabytes of data per year 

— SMD’s Earth Science Division operates 12 DAACs 
(archive centers) containing ~10 PB of data 

— Solar Dynamics Observatory (SDO) satellite 
produces 1 GB per minute; translates to ~3 PB 
over its 5-year life cycle 


¢ Model / Simulation Data 
— NAS Division has 20 PB of unique data in global 
filesystems and 115 PB of archive storage 
san Vue [olan merele(-Mavlalallale W-lmMEC-cslume|-lelc-\oM q-icxe) [0] i(e) a) 
on 35K cores produced 1.4 PB during its 5-day run; 
full simulation will produce 9—18 PB 


DISE (Data Intensive Supercomputing Environment) integrates Big Data 
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Fun Fact: The term “Big Data” was first used by Michael Cox & David Ellsworth of NAS Division in a Visualization ‘97 paper: 
“Visualizing flow around an airframe”, where largest dataset considered was 7.5 GB 
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Big Data Challenges for Users Nasa 
Conducted survey of NASA « Data Discovery: Finding what data is 


projects dealing with Big Data available and where 
to gather user requirements 


¢ Data Management: Transferring very 
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| computational resources 


Tools / Models / Algorithms: Developing 
analysis & analytics software at scale 


Analysis Workflow: Handling 
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Analysis / Analytics Infrastructure: 
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heterogeneous resources 
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Collaboration Environments: Difficulty 
with sharing knowledge across a wider 
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VISION 


To provide 
“Science as a 
Service’ to the 
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global 
eyanvalce)alantsyaiesl 
olareli(=vale ters 


cley.V 


aon lanle)cedu>) 
efficiency and 
expand the 
Jere) o- me) NTAtsya\ 
Earth science 
iKsYeu alate) (ele nA 
research, and 
Fel @) e)iker=tile) ars) 
fe) geyele- tants 


is) 


NEX Environment 


NASKEOSDIS) | 
DISTRIBUTED ACTIVE ATEHIVE)CENTETS 


Collaboration Portal Compute Resources Data Repository 
(over 400 members) (over 5 PF computing) (over 2 PB of data) 
Tools, Models, S-Lale] ole) Gifels Working copies of 
Workflows, Papers, Data experimentation, Observational and 
HPC (Pleiades) Project data 
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Sandbox currently available to a subset of scientists with NASA credentials 
HPC resources available only to approved projects with allocation 
OpenNEX, a collaboration with Amazon, provides NEX datasets to the wider 
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~800m grid resolution 

Spring (March—May), 1950-2099 


Mean temperature projected to 
increase from 12°C to 15°C 
assuming greenhouse gas 
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Area at or below 0°C isotherm 
decreases from 2.5M sq. km to 
0.6M sq. km 
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Details in paper by B. Thrasher et al.: “Downscaled climate projections suitable 
for resource management,” Eos, Vol. 94, No. 37, Sept. 2013, pp. 321-323 
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¢ Quantum mechanics deals with physical phenomena at very small scales 
(~100 nm) or at very low temperatures (few K) where actions are quantized 


¢ The outcome of a quantum experiment is 


probabilistically associated both with what Fafa] a 
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¢ N-bit string of unknown variables {2} 
¢ H,: Hamiltonian with known ground state 


¢ Hp: Hamiltonian whose ground state represents 
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A(t) is slowly (adiabatically) lowered to zero while 
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¢ Solution is the configuration {z} that produces the 
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NASA and Quantum O7oyaaoleiiiare 


~ 6 days price to detection via 
standard techniques 
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D-Wave Two System 


(@7o)|Flelele-lilolam-lanle)alem \ ate yam Glelele|(-mi-laeMOlo) a YAN 
led to installation of system at NAS Division 


¢ 512-qubit Vesuvius processor (to be continuously 
upgraded over the next 4+ years) 


¢ 10 kg of metal in vacuum at 15 mK 
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1 nanoTesla 
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Single run takes 20 usecs 
¢ Uses 12 kW of electrical power 


Focused on solving discrete 
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sreveletalisiiale the D-Wave Two waa 


No general algorithms, smart 
mathematical tricks (penalty 
functions, locality reduction..) 
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for random spin-glass models 


The D-Wave hardware qubit 


connectivity is a “Chimera Graph", 


$0 embedding methods mostly 
based on heuristics 
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Hote: D-Wave provides a heuristic blackbox 
compiler that bypasses embedding 


Embedding not needed for 
native Chimera problems 


Use symmetries, 
permutations, and error 
correction to eliminate 
the systemic hardware 
errors and check the 
solutions 


Performance can be 
improved dramatically with 
smart pre-/post-processing 
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Nel 7-laterso hk Cro)anl oUitialem Uisstiele Nasa 
Enable the science & engineering required to 
meet NASA’s missions and goals 
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Effective, stable, production- Advanced technologies 
level HPC environment to meet future goals 
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